On the application of estimation of distribution algorithms to multi-marker tagging SNP selection
نویسندگان
چکیده
This paper presents an algorithm for the automatic selection of a minimal subset of tagging single nucleotide polymorphisms (SNPs) using an estimation of distribution algorithm (EDA). The EDA stochastically searches the constrained space of possible feasible solutions and takes advantage of the underlying topological structure defined by the SNP correlations to model the problem interactions. The algorithm is evaluated across the HapMap reference panel data sets. The introduced algorithm is effective for the identification of minimal multi-marker SNP sets, which considerably reduce the dimension of the tagging SNP set in comparison with single-marker sets. New reduced tagging sets are obtained for all the HapMap SNP regions considered. We also show that the information extracted from the interaction graph representing the correlations between the SNPs can help to improve the efficiency of the optimization algorithm. keywords: SNPs, tagging SNP selection, multi-marker selection, estimation of distribution algorithms, HapMap.
منابع مشابه
Multi-objective Measurement Devices Allocation Using State Estimation in Distribution System
Allocation of measurement devices is a necessity of distribution system which is an application of state estimation. In this paper, the problem of active and reactive measurement devices is modeling using a multi-objective method. The objectives of the problem are to minimize the use of measurement devices, increase in state estimation output, improve the state estimation quality and reduce cos...
متن کاملApplication of single-nucleotide polymorphism (SNP) as a molecular marker in the study of genetic diversity of aquatic populations
Genetic diversity is one of the important and essential characteristics of any population for its survival. The study of genetic variation in different populations of aquatic organisms is of particular importance in order to protect, stabilize and manage their stocks. Based on studies conducted in recent years, molecular markers have proven that they can be used as indicators of the genetic div...
متن کاملThe Pattern of Linkage Disequilibrium in Livestock Genome
Linkage disequilibrium (LD) is bases of genomic selection, genomic marker imputation, marker assisted selection (MAS), quantitative trait loci (QTL) mapping, parentage testing and whole genome association studies. The Particular alleles at closed loci have a tendency to be co-inherited. In linked loci this pattern leads to association between alleles in population which is known as LD. Two metr...
متن کاملAn Adapted Non-dominated Sorting Algorithm (ANSA) for Solving Multi Objective Trip Distribution Problem
Trip distribution deals with estimation of trips distributed among origins and destinations and is one of the important stages in transportation planning. Since in the real world, trip distribution models often have more than one objective, multi-objective models are developed to cope with a set of conflict goals in this area. In a proposed method of adapted non-dominated sorting algorithm (ANS...
متن کاملComparing Different Marker Densities and Various Reference Populations Using Pedigree-Marker Best Linear Unbiased Prediction (BLUP) Model
In order to have successful application of genomic selection, reference population and marker density should be chosen properly. This study purpose was to investigate the accuracy of genomic estimated breeding values in terms of low (5K), intermediate (50K) and high (777K) densities in the simulated populations, when different scenarios were applied about the reference populations selecting. Af...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009